Remapping constant points in a white-box implementation

ABSTRACT

A non-transitory machine-readable storage medium encoded with instructions for execution by a keyed cryptographic operation by a cryptographic system mapping an input message to an output message, wherein the cryptographic operation includes at least one round including a non-linear mapping function configured to map input data to output data, including: instructions for determining that the input data has a diversification number less than a diversification level threshold number; instructions for remapping the input data to a remapped input data, wherein the remapped input data corresponds to an input data having a diversification number greater than or equal to the diversification threshold value, and instructions for inputting the remapped input data into the non-linear mapping function to obtain output data.

TECHNICAL FIELD

Various exemplary embodiments disclosed herein relate generally to securing software components that perform a cryptographic function against attacks including remapping constant point in a white-box implementation.

BACKGROUND

The Internet provides users with convenient and ubiquitous access to digital content. Because the Internet is a powerful distribution channel, many user devices strive to directly access the Internet. The user devices may include a personal computer, laptop computer, set-top box, internet enabled media player, mobile telephone, smart phone, tablet, mobile hotspot, or any other device that is capable of accessing the Internet. The use of the Internet as a distribution medium for copyrighted content creates the compelling challenge to secure the interests of the content provider. Increasingly, user devices operate using a processor loaded with suitable software to render (playback) digital content, such as audio and/or video. Control of the playback software is one way to enforce the interests of the content owner including the terms and conditions under which the content may be used. Previously many user devices were closed systems. Today more and more platforms are partially open. Some users may be assumed to have complete control over and access to the hardware and software that provides access to the content and a large amount of time and resources to attack and bypass any content protection mechanisms. As a consequence, content providers must deliver content to legitimate users across a hostile network to a community where not all users or user devices can be trusted.

Secure software applications may be called upon to carry out various functions such as, for example, cryptographic functions used to protect and authenticate digital content. In order to counter attacks, these algorithms have to be obfuscated (hidden) in order to prevent reverse engineering and modification of the algorithm or prohibit obtaining the user-specific secure information. Accordingly, the functions of the secure software application may be carried out by various functions as defined by the instruction set of the processor implementing the secure software. For example, one way to obscure these functions is by the use of lookup tables.

Content providers must deliver content to legitimate users across a hostile network to a community where not all users or devices can be trusted. This has lead to the development of white-box cryptography. In the white-box cryptography scenario it is assumed that the user has complete control of the hardware and software that provides access to the content, and an unlimited amount of time and resources to attack and bypass any content protection mechanisms. The secure software code that enforces the terms and conditions under which the content may be used should be tamper resistant. Digital rights management is a common application of secure software applications. The general approach in digital rights management for protected content distributed to user devices is to encrypt the digital content using for example, DES (Data Encryption Standard), AES (Advanced Encryption Standard), or using other known encryption schemes, and to use decryption keys to recover the digital content. These decryption keys must be protected to prevent unauthorized access to protected material.

In the digital right management scenario, the attacker has complete control of the software enforcing the management and access to the protected content. Accordingly, the attacker can modify software and also seek to obtain cryptographic keys used to encrypt the protected content. Such keys may be found by analyzing the software

Regarding key distribution, a media player has to retrieve a decryption key from a license database in order to play back the media. The media player then has to store this decryption key somewhere in memory for the decryption of the encrypted content. This leaves an attacker two options for an attack on the key. First, an attacker may reverse engineer the license database access function allowing the attacker to retrieve asset keys from all license databases. In this situation the attacker does not need to understand the internal working of the cryptographic function. Second, the attacker may observe accesses of the memory during content decryption, thus the attacker may retrieve the decryption key. In both cases the key is considered to be compromised.

The widespread use of digital rights management (DRM) and other secure software has given rise to the need for secure, tamper-resistant software that seeks to complicate tampering with the software. Various techniques for increasing the tamper resistance of software applications exist. Most of these techniques are based on hiding the embedded knowledge of the application by adding a veil of randomness and complexity in both the control and the data path of the software application. The idea behind this is that it becomes more difficult to extract information merely by code inspection. It is therefore more difficult to find the code that, for example, handles access and permission control of the secure application, and consequently to change it.

As used herein, white-box cryptography includes a secure software application that performs cryptographic functions in an environment where an attacker has complete control of the system running the white-box cryptography software. Thus, the attacker can modify inputs and outputs, track the operations of the software, sample and monitor memory used by the software at any time, and even modify the software. Accordingly, the secure functions need to be carried out in a manner that prevents the disclosure of secret information used in the secure functionality. White-box cryptography functions may be implemented in various ways. Such methods include: obscuring the software code; using complex mathematical functions that obscure the use of the secret information; using look-up tables; using finite state machines; or any other methods that carry out cryptographic functions but hide the secret information needed for those secure functions. A white-box implementation may also contain components that include anti-debugging and tamper-proofing properties.

There are several reasons for preferring a software implementation of a cryptographic algorithm to a hardware implementation. This may, for instance, be the case because a software solution is renewable if the keys leak out, because it is has lower cost, or because the application-developer has no influence on the hardware where the white-box system is implemented.

SUMMARY

A brief summary of various exemplary embodiments is presented below. Some simplifications and omissions may be made in the following summary, which is intended to highlight and introduce some aspects of the various exemplary embodiments, but not to limit the scope of the invention. Detailed descriptions of an exemplary embodiment adequate to allow those of ordinary skill in the art to make and use the inventive concepts will follow in later sections.

Various exemplary embodiments relate to a non-transitory machine-readable storage medium encoded with instructions for execution by a keyed cryptographic operation by a cryptographic system mapping an input message to an output message, wherein the cryptographic operation includes at least one round including a non-linear mapping function configured to map input data to output data, including: instructions for determining that the input data has a diversification number less than a diversification level threshold number; instructions for remapping the input data to a remapped input data, wherein the remapped input data corresponds to an input data having a diversification number greater than or equal to the diversification threshold value, and instructions for inputting the remapped input data into the non-linear mapping function to obtain output data.

Various embodiments are described further including instructions for splitting, by the cryptographic system, the input data into n split input data, wherein the splitting of the input data varies based upon the value of the input message; and instructions for inputting each split input data into the non-linear mapping function to obtain n split output data, wherein a combination of the n split output data indicates an output data, wherein the output data results when the input data is input to the non-linear mapping function.

Various embodiments are described further comprising: instructions for encoding the output data according to a selected one of a plurality of encoding schemes, wherein the encoding scheme is selected out of a plurality of encoding schemes based upon selection data which depends upon the input message; instructions for receiving in a next round of the cryptographic operation the encoder output as input; and instructions for compensating for the effect of the encoding according to a selected one of a plurality of recoding schemes based upon the selected recoding scheme out of the plurality of recoding schemes based upon the selection data.

Various embodiments are described wherein instructions for inputting each split input data into the non-linear mapping function includes instructions for inputting each split input data into a plurality of split mapping functions wherein XORing the plurality of split mapping functions results in the non-linear mapping function.

Various embodiments are described wherein the instructions for determining and remapping are associated with a current round of the keyed cryptographic operation producing an output of the current round that is the input to a next round of the keyed cryptographic operation and comprising in the next round compensating for the remapping of the output data.

Various embodiments are described wherein compensating for the remapping of the output data includes receiving an indication of the input value that was remapped.

Various embodiments are described wherein in the cryptographic operation is an advance encryption standard (AES) operation and wherein compensating for the remapping of the input data is based upon MC, S(0), and S(C), where MC is mix columns function, S(0) is the value of the AES substitution box for input of 0, and S(C) is the value of the AES substitution box for input of C, where the remapped output is set to the value of C.

Various embodiments are described wherein determining that the input data has a diversification number less than a diversification level threshold value includes determining that the input data corresponds to a constant point associated with the non-linear mapping function and wherein remapping the input data to a remapped input data includes setting the input data to a value C.

Various embodiments are described wherein the cryptographic operation is an advance encryption standard (AES) operation and wherein the constant point is 0.

Various embodiments are described wherein lookup tables implement the keyed cryptographic operation.

Various embodiments are described wherein finite state machines implement keyed cryptographic operation.

Further, various exemplary embodiments relate to a method of producing a cryptographic implementation of a cryptographic operation mapping an input message to an output message, wherein the cryptographic operation includes at least one round including a non-linear mapping function configured to map input data to output data, comprising: producing a cryptographic implementation of the cryptographic operation wherein the cryptographic implementation is configured to: determine that the input data has a diversification number less than a diversification level threshold number; remap the input data to a remapped input data, wherein the remapped input data corresponds to an input data having a diversification number greater than or equal to the diversification threshold value, and input the remapped input data into the non-linear mapping function to obtain output data.

Various embodiments are described wherein the cryptographic implementation is further configured to: split, by the cryptographic system, the input data into n split input data, wherein the splitting of the input data varies based upon the value of the input message; and input each split input data into the non-linear mapping function to obtain n split output data, wherein a combination of the n split output data indicates an output data, wherein the output data results when the input data is input to the non-linear mapping function.

Various embodiments are described wherein the cryptographic implementation is further configured to: encode the output data according to a selected one of a plurality of encoding schemes, wherein the encoding scheme is selected out of a plurality of encoding schemes based upon selection data which depends upon the input message; receive in a next round of the cryptographic operation the encoder output as input; and compensate for the effect of the encoding according to a selected one of a plurality of recoding schemes based upon the selected recoding scheme out of the plurality of recoding schemes based upon the selection data.

Various embodiments are described wherein inputting each split input data into the non-linear mapping function includes inputting each split input data into a plurality of split mapping functions wherein XORing the plurality of split mapping functions results in the non-linear mapping function.

Various embodiments are described wherein determining and remapping are associated with a current round of the keyed cryptographic operation producing an output of the current round that is the input to a next round of the keyed cryptographic operation and comprising in the next round compensating for the remapping of the output data.

Various embodiments are described wherein compensating for the remapping of the output data includes receiving an indication of the input value that was remapped.

Various embodiments are described wherein in the cryptographic operation is an advance encryption standard (AES) operation and wherein compensating for the remapping of the input data is based upon MC, S(0), and S(C), where MC is mix columns function, S(0) is the value of the AES substitution box for input of 0, and S(C) is the value of the AES substitution box for input of C, where the remapped output is set to the value of C.

Various embodiments are described wherein determining that the input data has a diversification number less than a diversification level threshold value includes determining that the input data corresponds to a constant point associated with the non-linear mapping function and wherein remapping the input data to a remapped input data includes setting the input data to a value C.

Various embodiments are described wherein in the cryptographic operation is an advance encryption standard (AES) operation and wherein the constant point is 0.

Various embodiments are described wherein lookup tables implement the keyed cryptographic operation.

Various embodiments are described wherein finite state machines implement keyed cryptographic operation.

BRIEF DESCRIPTION OF THE DRAWINGS

In order to better understand various exemplary embodiments, reference is made to the accompanying drawings, wherein:

FIG. 1 illustrates the main steps of a round of AES;

FIG. 2 illustrates a white-box AES implementation with fixed encodings on the input of the rounds;

FIG. 3 illustrates the computation of one output nibble by means of a network of look-up tables;

FIG. 4 illustrates a portion of the network table of FIG. 3 obfuscated by encoding the inputs and outputs;

FIG. 5 shows how c_(i) ⁻¹·y_(j,3,2) may be computed for i≠j≠4;

FIG. 6 shows how c_(i) ⁻¹·y_(j,3,2) may be computed for i≠j and i≠4;

FIG. 7 illustrates a XOR network used to calculate c₄;

FIG. 8 illustrates determining if an output z_(k,j) equals 0 that may be implemented by a small table network;

FIG. 9 illustrates remapping the 0 output to a different value; and

FIG. 10 illustrates the compensation for remapping the 0 value in the next round.

To facilitate understanding, identical reference numerals have been used to designate elements having substantially the same or similar structure and/or substantially the same or similar function.

DETAILED DESCRIPTION

The description and drawings illustrate the principles of the invention. It will thus be appreciated that those skilled in the art will be able to devise various arrangements that, although not explicitly described or shown herein, embody the principles of the invention and are included within its scope. Furthermore, all examples recited herein are principally intended expressly to be for pedagogical purposes to aid the reader in understanding the principles of the invention and the concepts contributed by the inventor(s) to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions. Additionally, the term, “or,” as used herein, refers to a non-exclusive or (i.e., and/or), unless otherwise indicated (e.g., “or else” or “or in the alternative”). Also, the various embodiments described herein are not necessarily mutually exclusive, as some embodiments can be combined with one or more other embodiments to form new embodiments.

There are several reasons for preferring a software implementation of a cryptographic algorithm to a hardware implementation. This may, for instance, be the case because a software solution is renewable if the keys leak out, because it is has lower cost, or because the application-developer has no influence on the hardware where the white-box system is implemented. While the description of embodiments below are directed to software implementation running on a processor, it is noted that these embodiments may also be partially or completely implemented in hardware as well. The lookup tables and finite state machines that are described may be implemented in hardware to carry out the various functions described.

A table-based approach to a white-box implementation of the Advanced Encryption Standard (AES) and the Data Encryption Standard (DES) were proposed in the following papers: “White-Box Cryptography and an AES Implementation”, by Stanley Chow, Philip Eisen, Harold Johnson, and Paul C. Van Oorschot, in Selected Areas in Cryptography: 9th Annual International Workshop, SAC 2002, St. John's, Newfoundland, Canada, Aug. 15-16, 2002, referred to hereinafter as “Chow 1”; and “A White-Box DES Implementation for DRM Applications”, by Stanley Chow, Phil Eisen, Harold Johnson, and Paul C. van Oorschot, in Digital Rights Management: ACM CCS-9 Workshop, DRM 2002, Washington, D.C., USA, Nov. 18, 2002, referred to hereinafter as “Chow 2”. Chow 1 and Chow 2 disclose methods of using a table-based approach to hide the cryptographic key by a combination of encoding its tables with random bijections, and extending the cryptographic boundary by pushing it out further into the containing application.

However, a weakness in the approach of Chow is pointed out in a paper by Olivier Billet, Henri Gilbert, and Charaf Ech-Chatbi, “Cryptanalysis of a White Box AES Implementation,” in Helena Handschuh and M. Anwar Hasan, editors, Selected Areas in Cryptography, volume 3357 of Lecture Notes in Computer Science, pages 227-240, Springer, 2005, referred to hereinafter as “Billet”. This weakness may be exploited by an attacker, and may in a worst case result in revealing the secret key hidden in the white-box implementation.

The key observation by Billet is that the individual input words of the respective rounds in a white box implementation according to Chow are in a particular relation to corresponding input words in an ordinary non-white-box implementation. This relation can be expressed without reference to other input values than the individual input value. Although this relation is unknown to the attacker, this feature provides enough structure to substantially simplify the breaking of the white-box, as further explained in Billet.

Accordingly, it would be advantageous to have an improved white-box cryptographic system for performing a cryptographic operation which maps an input-message to an output-message that would counter the attack identified in Billet.

One reason why the construction of Chow is vulnerable is the fixed relation that exists between individual input values of respective rounds in the white-box implementation and individual input values in the ordinary non-white-box implementation. In the embodiments of the invention described below the splitting of an input byte x into n bytes α₁(x), α₂(x), . . . , α_(n)(x) using a series of functions α_(n) is introduced. The functions α_(n) are selected so that when each of the n bytes pass through the S-box, the results may be combined to achieve the desired result of passing the original by x through the S-box. The specific α_(n) functions used to split the input bytes are based upon the input, so they vary as the input varies. This feature breaks the fixed relationship exploited by the Billet attack. The split inputs are no longer in a fixed relationship with some individual input value in the non-white box implementation of the cryptographic operation. This complication is enough to foil the attack laid out in Billet.

By replacing the fixed encoding of values with splitting of the values, reverse engineering of the cryptographic system becomes harder, as it is harder for the reverse engineer to compare the working of the cryptographic system according to the invention with the workings of a non-white-box version of the cryptographic operation.

As noted, for many cryptographic operations it is desired to have a white-box implementation. The invention may be applied, for example, to symmetric and asymmetric cryptographic operations. Also, the invention may be applied to block ciphers, stream ciphers, message authentication schemes, signature schemes, etc. Note that the invention may also be applied to hash functions. The latter is especially useful if the hash function is used as a building block which processes secret information, e.g., a secret key, secret data, etc. For example, the invention may be applied to a hash function used in a keyed-Hash Message Authentication Code (HMAC or KHMAC). Well known block ciphers include: Advanced Encryption Standard (AES), Secure And Fast Encryption Routine, (SAFER, and variants SAFER+ and SAFER++), Blowfish, Data Encryption Standard (DES), etc. A well known stream cipher is RC4. Moreover any block cipher can be used as stream cipher using an appropriate mode of operation, e.g., Cipher feedback (CFB), Counter mode (CTR), etc.

The input message can represent, e.g., encrypted content data, such as multi-media data, including audio and/or video data. The encrypted content data may also include encrypted software, e.g., encrypted computer code representing some computer application, e.g., a computer game, or an office application. The input message may also represent a key for use in a further cryptographic operation. The latter may be used, for example, in a key exchange protocol, wherein a white-box implementation according to the invention encrypts and/or decrypts data representing a new key. The input data may also be plain data, for example, plain user data. The latter is especially advantageous in message authentication schemes. A white-box implementation according to the invention may have the property that the implementation may only be used for encryption, only be used for decryption, but not for both. For example, this property can be achieved if the implementation uses look-up tables which are not bijective, for example, a look-up table having more input bits than output bits. Accordingly, if a user only has a white-box decryptor, he may verify a MAC code but not create new MACS. This strengthens the non-repudiation properties of such a message authentication scheme.

The white-box implementation may be implemented using a plurality of basic blocks. The plurality of basic blocks is interconnected, in the sense that some of the blocks build on the outputs of one or more of the previous blocks. A basic block may be implemented in hardware, for example, as a computer chip. A basic block may use a switch board, a state machine or any other suitable construction for implementing functions in computer hardware. A basic block may also be implemented in software running on a general purpose computer chip, e.g. a microprocessor. For example, a basic block may use a plurality of computer instructions, including arithmetical instructions, which together implement the functionality of the basic block. A widely used implementation for the basic block, which may be used both in software and hardware, is a look-up table. For example, Chow 1 and Chow 2 take this approach to implement the AES and DES block ciphers. A look-up table implementation includes a list which lists for possible input values, an output value. The input value may be explicit in the lookup table. In that situation the look-up table implementation could map a particular input to a particular output by searching in the list of input values for the particular input. When the particular input is found the particular output is then also found. For example, the particular output may be stored alongside the particular input. Preferably, the input values are not stored explicitly, but only implicitly. For example, if the possible inputs are a consecutive range, e.g. of numbers or bit-strings, the look-up table may be restricted to storing a list of the output values. A particular input number may, e.g., be mapped to the particular output which is stored at a location indicated by the number. Further, finite state machines or code obfuscation may be used to implement the white-box implementation.

For example, a look up table for a function may be created by computing the output value of the function for its possible inputs and storing the outputs in a list. If the function depends on multiple inputs the outputs may be computed and stored for all possible combinations of the multiple inputs. Look-up tables are especially suited to implement non-linear functions, which map inputs to output in irregular ways. A white-box implementation can be further obfuscated, as is explained below, by applying to one or more of its look-up tables a fixed obfuscating input encoding and a fixed output encodings. The results of applying a fixed obfuscating input encoding and output encodings is then fully pre-evaluated. Using this technique, a look-up table would be replaced by an obfuscated look-up table which has the same dimensions, that it takes the same number input bits and produces the same number of output bits. The input encoding and output encoding used in such obfuscation are not explicit in the final white-box implementation. A better obfuscation is achieved in the embodiments of the invention described below, which introduces splitting input bytes in a manner which is not fixed but rather depend on the input. It is noted that the input splitting described herein may be combined with traditional obfuscation techniques to advantage, as together they further obscure the inner workings of the cryptographic operation.

The network of basic blocks are arranged to compute an output message when they are presented with an input message. Typically, the input message is operated upon by a number of basic input blocks. A number of further basic blocks may take input from one or more of the basic input blocks and/or from the input. Yet further basic blocks can take input in any combination of the input message, the output of basic input blocks and the output of the further basic blocks. Finally some set of basic exit blocks, i.e., at least one, produce as output all or part of the output-message. In this manner a network of basic blocks emerges which collectively computes the mapping from the input message to output message.

The key used may be a cryptographic key and may contain sufficient entropy to withstand an anticipated brute force attack. It is noted that in a white-box implementation, the key is typically not explicitly present in the implementation. This would risk the key being found by inspection of the implementation. Typically, the key is only present implicitly. Various ways are known to hide a key in a cryptographic system. Typically, at least the method of partial evaluation is used, wherein a basic block which needs key input is evaluated in-so-far that it does not depend on the input-message. For example, a basic operation wherein an input-value, a masking value, which does not depend on the input-message, e.g. a value from an S-box, and a key-value need to be XORed can be partially evaluated by XORing the key value and the masking value together beforehand. In this way the operation still depends on the key-value although the key-value is not explicitly present in the implementation. Instead, only the XOR between the key-value and masking-value is present in the implementation. Note that, more complicated ways and/or further ways of hiding the keys are compatible with embodiments of this invention.

Various splitting functions α_(n) may be used with the invention. A splitting function α_(n) provides a different way to represent a data value. Typically, the splitting function α_(n) is also bijective, although this is not necessary.

A set of self-equivalent splitting functions may be used to split the input. For example, if x is an input to an S-box and y is the output, then a pair of functions α and β are self equivalent if an input α_(n)(x) to the S-box results in an output β_(n)(y). Such functions provide benefit in splitting the input to the S-box, for if the β functions are selected so that their combination results in the output value y or a known function of y, then when the equivalent a functions are used to split the input, it is easy to obtain the desired output from the S-box while breaking the fixed encoding exploited by the Billet attack.

Determining affine self-equivalent splitting functions may be done using any suitable manner. For example, as described in the paper “A Toolbox for Cryptanalysis: Linear and Affine Equivalence Algorithms,” by A. Biryukov, C. De Canniere, A. Braeken, and B. Preneel, Proceedings of Eurocrypt, 2003, pp. 33-50, referred hereinafter as Biryukov. In Biryukov, methods are described for determining affine self-equivalent functions for various encryption methods.

US Patent Publication No. 2012/0002807 to Michiels (“Michiels”) provides another solution to overcome the attack of fixed encodings described by Billet, which is hereby incorporated herein for all purposes. In Michiels, variable encodings are put on the S-box inputs. If (αβ) is an affine self-equivalence of the AES S-box, then giving the S-box α(x) as input results in output β(y), if S(x)=y. Let the affine self-equivalences be numbered from 1 onwards. Then, during execution an α_(i)-encoding can be put on the S-box input, where the choice of index i (i.e., the choice of the affine self-equivalence) is computation dependent. In an additional computation track, the value i (in encoded form) is kept track of. After the S-box, this value i is used to compensate for the variable encoding that has been put on the S-box input.

Essential for both approaches to overcoming the Billet attack is that if in two different executions the input x to an S-box is the same, the identical input values may be are mapped to different values in the white-box implementation. To satisfy this property for all inputs, the set V₁ may not have a constant point x in the sense that for all functions ƒ∈V₁ the point ƒ(x) is the same. For AES, however, the set V of self-equivalences is such that 0 is a constant point of V₁.

The fixed encodings on the input and output bytes of the round cause a vulnerability exploited by the known attacks. Both approaches to overcome Billet are based on the fact that a set V₁, of functions is constructed, such that the input bytes x_(i), are hidden in intermediate values ƒ(x_(i)), where ƒ is taken from V₁. Here, function ƒ is not always taken the same. That is, in different executions, a different function from V₁ may be taken. Essential in the approaches is that a value x_(i) can be mapped to different values, depending on the function ƒ that is selected.

Let U be the set of constant points for V₁. That is, U={x|∀_(ƒ,g∈v) ₁ ƒ(x)=g (x)}. Then, for the points in U both approaches still correspond to the fixed encoding problem. For AES, the set V₁ as proposed in the embodiments of both approaches is defined by V=V₁×V₂ where V is the set of affine self-equivalences of an S-box. This set V₁ has 0 as constant point.

Now, the essential feature of the embodiment described herein is that the constant points are mapped to non-constant points. Typically this reduces the domain of the table hiding the S-box. More precisely, this means that for all x∈U we have the table (or in case of, e.g., a finite-state-machine approach for white-boxing: the finite state machine) that hides an n-bit S-box has as input a value from the set 2^(n)−U instead of the set 2^(n).

This can be realized as follows. Before the table hiding an S-box S, the value ƒ∘g(x) is computed instead of ƒ(x) where g maps x to itself if x∉U and where g maps x to an arbitrary different value outside U if x∈U. After the (hidden) S-box operation, this possible remapping of x is undone as explained in the embodiments below.

First a description of an embodiment that splits the inputs into a number of chunks is described. Then a description of how to further extend that embodiment so that any constant point are mapped to non-constant points.

Below exemplary embodiments are described using the AES (Advanced Encryption Standard) block cipher, because AES has become a widely used standard for block ciphers. AES is a block cipher with a block size of 128 bits or 16 bytes. The plaintext is divided in blocks of 16 bytes which form the initial state of the encryption algorithm, and the final state of the encryption algorithm is the cipher text. At any given point in the encryption algorithm these 16 bytes are the state of the encryption algorithm. To conceptually explain AES, the bytes of the state are organized as a matrix of 4×4 bytes. AES includes a number of rounds, which depend on the key size. Each round includes similar processing steps operating on bytes, rows, or columns of the state matrix, each round using a different round key in these processing steps. In the discussion using AES as an example, it is noted that AES defines a round in a specific manner. In the embodiments below, a round is any grouping of steps that includes at least one non-linear mapping function, such as an S-box in AES. Accordingly, a round as described below includes one non-linear mapping function and any combination of other steps of the cryptographic function. Further, the boundary of the round may start with the non-linear mapping function, for example an S-box, or any other operation that may be merged with the non-linear mapping function, for example a key addition.

FIG. 1 illustrates some main processing steps of a round of AES. The processing steps include:

AddRoundKey 110—each byte of the state is XORed with a byte of the round key;

SubBytes 120—a byte-to-byte permutation using a lookup table;

ShiftRows 140—each row of the state is rotated a fixed number of bytes; and

MixColumns 150—each column is processed using a modulo multiplication in GF(28).

The steps SubBytes 120, ShiftRows 130, and MixColumns 150 are independent of the particular key used. The key is applied in the step AddRoundKey 110. Except for the step ShiftRows 140, the processing steps can be performed on each column of the 4×4 state matrix without knowledge of the other columns. Therefore, they can be regarded as 32-bit operations as each column consists of four 8-bit values. Dashed line 150 indicates that the process is repeated until the required number of rounds has been performed.

Each of these steps or a combination of steps may be represented by a lookup table or by a network of lookup tables. If the AddRoundKey 110 step is implemented by XORing with the round key, then the key is visible to the attacker in the white-box attack context. The AddRoundKey 110 step can also be embedded in lookup tables, which makes it less obvious to find out the key. In fact, it is possible to replace a full round of AES by a network of lookup tables. For example, the SubBytes 120, ShiftRows 130, and MixColumns 150 steps may be implemented using table lookups. Below a possible white-box implementation of AES in sufficient detail is discussed to describe the embodiments of the invention below, but further detailed descriptions of such an implementation are found in Chow 1. Also, other variations in the lookup table implementation may be used which are within the scope of the invention.

Embodiments described below provide a different approach for implementing white-box implementations that do not have the weakness as described in Billet of having a fixed encoding on the S-box inputs. This fundamental weakness does not only apply to the table-based approach of Chow 1 and Chow 2, it also applies to finite state machine approaches and the approach of applying generic code transformation techniques to securely implement a cryptographic cipher. Also, for these approaches, the techniques of the embodiments of the invention described below may be used.

Both the table-based white-box implementations and the finite state machine implementations have the property that all intermediate values in the implementation are encoded (as compared to a standard implementation). Examples of white-box implementations using finite state machines are disclosed in U.S. Patent Publication 2007/0014394 entitled “Data Processing Method” and a presentation at the Re-trust Sixth Quarterly Meeting entitled “Synchrosoft MCFACT™ Secure Data Processing Technology” by Wulf Harder and Ads Straujums dated Mar. 11, 2008, which each are hereby incorporated by reference for all purposes as if fully set forth herein. However, Billet shows that the implementation can be broken (i.e., the cryptographic key can be extracted) if the input to S-boxes is encoded by fixed encodings. FIG. 2 illustrates a white-box AES implementation with fixed encodings on the input of the rounds, i.e., on the input of the S-boxes. As shown, each of the 16 input bytes are encoded by f_(i) and each of the output bytes are encoded by g_(i).

As indicated, the white-box implementation illustrated in FIG. 2 may be broken due to the fixed encodings on the input (and output) bytes. This problem may be solved by splitting each input byte into n bytes, for some n>1. This split may be accomplished in a non-fixed way. Further, in some embodiments of the invention, in 2 independent runs of the white-box implementation, if the input byte x_(i) to the round is the same, the input byte x_(i) may be split into different n-byte-tuples. This implies that, for an attacker observing individual bytes in this tuple, the application of the splitting functions is not a fixed encoding of x_(i). This protects against all known attacks.

Presented below is a more precise and formal description of the splitting of the input bytes.

First, let V be the set of affine self-equivalences of an S-box, i.e., V={(α,β)|S=β∘S∘α⁻ with α, β affine}.

Next, let the set W contain n pairs from V as follows, where h is an affine bijective function: W ⊂{Γ=(Γ₁,Γ₂, . . . ,Γ_(n))∈V ^(n)|Γ_(i)=(α_(i),β_(i)) with ∀_(x)⊕_(i)β_(i)(x)=h(x)}. The function h(x) may be a fixed affine function, such as the identity function, but can also be a by-the-computation-selected affine function. For example, in one embodiment described below, the function. β_(i)(x)=c_(i)x where c_(n)=1⊕(⊕_(i=1) ^(n-1)c_(i)) and c_(i), for i<n depends upon the input state. Alternatively, in another embodiment, ⊕_(i=1) ^(n)β_(i)(x)=h(x) for function h being given by the function β (possibly up to a constant addition) with (α,β)∈V that is associated with multiplication coefficient c=⊕_(i=1) ^(n)c_(i). This results in an example where h is dependent on a computation based upon the input state.

For an input byte x, n copies of the S-box are fed with the inputs α₁(x), α₂(x), . . . , α_(n)(x) for any Γ∈W. This results in the S-box outputs β₁(y), β₂(y), . . . , β_(n)(y). XORing the S-box outputs β₁(y), β₂(y), . . . , β_(n)(y) results in the correct S-box output y=S(x) up to an affine mapping h. Generally, the XOR is not calculated explicitly in order to minimize the opportunity to attack the white box implementation. This even holds for the values α_(i)(x) and β_(i)(y), which are generally merged with other functions as the S-box is typically merged with other functions. This will be described in greater detail below with respect to the various embodiments.

The choice of Γ∈W depends on a computation based upon the input bytes. Thus, it is not fixed, and as a result the Billet attack cannot be used to attack the white-box implementation.

Biryukov describes an algorithm to compute the set of affine self-equivalences of an S-box as described below.

In order to describe embodiments of the invention, a basic description of a table-based white-box AES implementation will be described. For a more detailed description of a method for implementing a table-based white-box AES see Chow 1. Chow 1 illustrates a specific implementation that breaks up certain functions using tables of specified sizes. It is well understood that various other divisions of the tables may be made resulting in different functions for the look-up tables and different sizes. Further, while the embodiments of the invention described below use a table-based white-box implementation of AES, other ciphers and cryptographic functions may be implemented according to the embodiments described. Also, other types of white-box implementations may be used instead of the table-base implementation, for example, a finite-state implementation.

The description of the table-based white-box AES is split into two steps. In the first step, a round of AES is described as a network of lookup tables. In the second step, the tables are obfuscated by encoding their input and output.

Step 1: Implementing AES as a Network of Lookup Tables.

AES operates on data blocks of 16 bytes. These are typically described as a 4×4 byte matrix, called the state including bytes x_(1,1), x_(1,2), x_(1,3), . . . x_(4,4). A round of AES as described above with respect to FIG. 1 include the following operations: AddRoundKey 110, SubBytes 120, ShiftRows 130, and MixColumns 140. The first two operations, AddRoundKey and SubBytes can be merged into a single T-box operation. That is, we can define a byte-to-byte function T_(i,j) for input byte x_(i,j) as T_(i,j) (x_(i,j))=S(x_(i,j)⊕k_(i,j)) where k_(i,j) is a single byte of a 16 byte round key based upon the AES key. Let y_(i,j) be the output of T_(i,j). The ShiftRows operations is just an index-renumbering of the output bytes y_(i,j). For ease of presentation, this operation is omitted in this description, but may be incorporated into the look-up table implementing Ti,j or implemented as a separate manipulation of the state matrix. In the MixColumns step, an output byte z_(i,j) of the round is computed from the 4 output bytes y_(i,j), y_(2,j), y_(3,j), and y_(4,j) via the algebraic expression z_(i,j)=MC_(l,1)·y_(1,j)⊕MC_(l,2)·y_(2,j)⊕MC_(l,3)·y_(3,j)⊕MC_(l,4)·y_(4,j) in GF(28) for some constants MC_(l,r).

Now define a lookup table for each byte-to-byte function Q_(i,j,l)(x_(i,j))=MC_(l,i)·T_(i,j)(x_(i,j)) with i, j, l=1, 2, . . . , 16. Then any output byte z_(l,j) may be computed by XORing the results of these lookup tables, i.e., z_(l,j)=Q_(1,j,l)(x_(1,j))⊕Q_(2,j,l)(x_(2,j)) ⊕Q_(3,j,l)(x_(3,j)) ⊕Q_(4,j,l)(x_(4,j)). Note that the index i, j, l of Q-box can be interpreted as “the contribution of input byte i, j of a round to output byte l, j of the round”. The XOR may be implemented to operate on each of two nibbles (i.e., 4-bit values) as a lookup table to reduce the size of the XOR tables. Accordingly, the Q-box may be implemented to produce output nibbles so that the size of the tables is reduced. Therefore, the computation of each output byte z_(l,j) of an AES-round has been described as a network of lookup tables. The network of lookup tables to compute a single output nibble of byte z_(2,3) is shown in FIG. 3.

FIG. 3 illustrates the computation of one output nibble by means of a network of look-up tables. The superscript index (1) in the Q-boxes indicates that the tables only provide the first nibble of the output of the Q-box. A set of input bytes x_(1,3), x_(2,3), x_(3,3), and x_(4,3) in the input state 310 are input into the Q-boxes 320, 322, 324, 326. The outputs of lookup tables 320 and 322 are fed into the XOR 330, and the outputs of lookup tables 324 and 326 are fed into the XOR 332. The outputs of XORs 330 and 332 are fed into XOR 334. The output of XOR 334 is the first nibble of the output z_(2,3) of output state 340. The second nibble of the output z₂₃ of output state 340 may be calculated in the same way using additional Q-boxes along with a similar XOR network. Further, additional sets of tables may be implemented to completely convert the input state 310 into the output state 340 by receiving a column of bytes from the input state and converting them into the output of the corresponding column of the output state.

Step 2: Obfuscating the Tables and the Intermediate Values

In the implementation depicted in FIG. 3, the key may easily be extracted from the Q-boxes. Just applying the inverse MixColumns multiplication and the inverse S-box to the output reveals the plain AddRoundKey operation. To prevent this, the input and outputs of all lookup tables are encoded with arbitrary bijective functions. This is described in Chow 1. This means that a lookup table is merged with an encoding function that encodes the output and with a decoding function that decodes the input. The encodings are chosen such that the output encoding of one table matches the input encoding assumed in the next tables. A portion of the implementation of FIG. 3 is depicted in FIG. 4 for the first round. In this example, the input to the round is not encoded in order to be compliant with AES, but the output of the round is encoded. The output encoding is handled in the next round. That is, unlike the first round, the second round (and the later rounds) assumes that the input is encoded. Alternatively, the first round may receive an encoded input. This input encoding must then be applied elsewhere in the software program containing the white-box implementation. Similarly, the last round may or may not include an output encoding depending on whether the output is to be AES compliant. Note that in the white-box implementation obtained, both the lookup tables and the intermediate values are obfuscated.

FIG. 4 illustrates a portion of the network table of FIG. 3 obfuscated by encoding the inputs and outputs. The lookup tables 420, 422, 424, 426 correspond to lookup tables 320, 322, 324, 326 of FIG. 3. The outputs of look tables 420, 422, 424, 426 are encoded by functions f₁, f₂, f₃, f₄ respectively. XOR 430 corresponds to XOR 330. The inputs of XOR 430 decode input using f₁ ⁻ and f₂ ⁻¹. The output of XOR 430 is then encoded by function f₅. In a similar manner XORs 432, 434 have input decodings and output encodings as illustrated. The output z_(2,3) is encoded using f₇. While the encodings on the inputs and outputs of the lookup tables provide some protection against simple attacks, an attacker may use the techniques described in Billet to attack this implementation of a white-box AES at the boundaries between the rounds of the AES implementation.

Embodiments are now described that prevent the attack as described by Billet. In the description of these embodiments a non-obfuscated table network is described in order to facilitate the description without added complexity. The embodiments described may be turned into an obfuscated table network in the same way as described in FIG. 4. Furthermore, the description focuses on how to implement the first round to provide the split input of the second round. One of skill in the art may obtain the complete white-box implementation by extending these ideas to all rounds.

The description of the table lookup based white-box implementation described above was for the encryption operation of AES. It is noted that the above description is easily adapted for the decryption operation by using the inverse of the SubBytes, ShiftRows, and MixColumns operations (invSubytes, invShiftRows, and invMixColumns). Accordingly, it is assumed that the description above can be used for either the encryption or decryption operation of AES as needed in the embodiments below.

The embodiments described below include the following aspects as previously described above. Let V be the set of affine self-equivalences of an S-box, i.e., V={(α,β)|S=β∘S∘α⁻¹ with α, β affine}.

Next, let the set W contain n pairs from V as follows, where h is an affine bijective function: W ⊂{Γ=(Γ₁,Γ₂, . . . ,Γ_(n))∈V ^(n)|Γ_(i)=(α_(i),β_(i)) with ∀_(x)⊕_(i)β_(i)(x)=h(x)}. The function h(x) may be a fixed affine function, such as the identity function, but can also be a by-the-computation-selected affine function.

For an input byte x, n copies of the S-box are fed with the inputs α₁(x), α₂(x), . . . , α_(n)(x) for any Γ∈W. This results in the S-box outputs β₁(y), β₂(y), . . . , β_(n)(y). XORing the S-box outputs β₁(y), β₂(y), . . . , β_(n)(y) results in the correct S-box output y=S(x) up to an affine mapping h. Generally, the final XOR to obtain y is not calculated explicitly in order to minimize the opportunity to attack the white box implementation. This even holds for the values α_(i)(x) and β_(i)(y), which are generally merged with other functions as the S-box is typically merged with other functions.

The choice of Γ∈W depends on a computation based upon the input bytes as further described below. It is not fixed, and as a result the Billet attack cannot be used.

So, first the set V of affine self-equivalences of the AES S-box is defined. The AES S-box can be written as: S(x)=A _(AES)(x ⁻¹), where the inverse is taken in GF(2⁸) and A_(AES) is a fixed affine mapping. As shown by Biryukov, the set of self-equivalences is given by V={ ^(2i)√{square root over (c ⁻¹ ·x)},A _(AES)(^(2i)√{square root over (c·A _(AES) ⁻(x))})|0≦l<8

c∈GF(2⁸)\{0}}. We note that for any (α, β)∈V the function α is linear.

Further, for the function h the constant addition of A_(AES) may be used, i.e., h(x)=x⊕C_(AES) with A_(AES)=L_(AES)(x)⊕C_(AES) for a linear mapping L_(AES). Now let n=4 and define the set W as W={Γ∈V ^(n)|Γ_(i)=(α,β) with β_(i) :x

A _(AES)(c _(i) ·A _(AES) ⁻(x))

c _(n)=1⊕(⊕_(j=1) ^(n-1) c _(j))}. This formulation complies with the more abstract definition of W. That is, for any Γ=(α₁,β₁), (α₂,β₂), . . . , (α_(n),β_(n)),∈W and any byte x, then ⊕_(i)β_(i)(x)=h(x) as can be verified as follows: ⊕_(i=1) ⁴(A _(AES)(c _(i) ·A _(AES) ⁻¹(x))) ={A _(AES) affine, constant canceled out} L _(AES)(⊕_(i=1) ⁴(c _(i) ·A _(AES) ⁻¹(x))) ={distributive law} L _(AES)((⊕_(i=1) ⁴ c _(i))·A _(AES) ⁻¹(x))) ={c ₄=1⊕(⊕_(j=1) ³ c _(j))} L _(AES)(A _(AES) ⁻¹(x))=h(x)

The computation of output byte z_(2,3) of first round of AES is now shown according to an exemplary embodiment of the invention for n=4 and h(x)=x⊕C_(AES) being the identity function. Alternatively, other values of n and h(x) may be chosen. The computation of other bytes runs similarly. This specifies the white-box implementation of the first round without the obfuscation step. The input to the first round is the input state x to be encrypted or decrypted. Bytes of x are input to the Q-boxes to produce nibbles of the y values. The various y values output by the Q-boxes may then be combined into output bytes z that are part of the output state for the round. The y values may be used to select the input splitting functions α that will be applied to the output bytes of z because the output bytes z of the first round become the input bytes of the second round. As described below, these splitting functions may be incorporated while calculating the final values of the bytes z. Then in the second round, the output will be split using another set of splitting functions α selected in a similar manner as in the first round to produce the split input to the third round. This continues until the next to last round. This splitting of the input by the set of splitting functions α protects the interface between rounds of the AES process from the Billet attack.

Further, to ease the presentation, the key-addition of the second round is ignored for the time being. This means that the input of an AES S-box of the second round is given by an output byte of the first round. Later on, it will be described how to cope with the key-addition.

The choice of Γ∈W depends on a computation based upon the input bytes, and because it is not fixed, the Billet attack cannot be used against z_(2,3). Because n=4, the first round now has to compute the n=4-tuple of bytes α_(2,3,1) (z_(2,3)), α_(2,3,2) (z_(2,3)), α_(2,3,3) (z_(2,3)), α_(2,3,4) (z_(2,3)) (where the output byte z_(2,3) is used as an example), where the affine functions α_(2,3,k) are taken from W and are not fixed, i.e., can be different for different runs of the white-box implementation. These bytes α_(2,3,1)(z_(2,3))) α_(2,3,2) (z_(2,3)), α_(2,3,3) (z_(2,3)), α_(2,3,4) (z_(2,3)) become the input to the second round. Being an element from W, means that α_(2,3,k):x

c_(k) ⁻¹·x for some c_(k)∈GF(2⁸)\{0} with c₄=1⊕c₁⊕c₂⊕c₃ because h(x)=x⊕C_(AES) in this exemplary embodiment.

There are 4 input bytes to the first round contributing to the value of z_(2,3). On these four input bytes, first a Q-box operation is performed as shown in FIG. 3. Let y_(i,3,2) be the ith output byte of these four Q-box operations. Now let c_(i) be obtained from the first nibble of y_(i,3,2) via c_(i)=(y_(i,3,2) ⁽¹⁾, 1,0,0,0)∈GF(2⁸)\{0}, for i≠4. For i=4, let c₄=1⊕c₁⊕c₂⊕c₃. Hence, c₄=1⊕(y_(1,3,2) ⁽¹⁾⊕y_(2,3,2) ⁽¹⁾⊕y_(3,3,2) ⁽¹⁾,1,0,0,0)∈GF(2⁸)\{0}. This leads to the α's being dependent upon the input state and hence the α's will vary as the input state varies. In the network of FIG. 3, we insert after the Q-boxes a computation of the values c₁ ⁻¹·y_(j,3,2), c₂ ⁻¹·y_(j,3,2), c₃ ⁻¹·y_(j,3,2), c₄ ⁻¹·y_(j,3,2). FIG. 5 shows how c₁ ⁻¹·y_(j,3,2) may be computed for i=j≠4. The input x_(1,3) is input to the look-up tables 520 and 522. The look-up tables 520 and 522 operate on bytes to produce nibbles of the output y_(1,3,2). The nibbles of y_(1,3,2) are input to a table 530 that calculates c₁ ⁻¹·y_(1,3,2), by combining the nibbles and multiplying by c₁ where c₁=(y_(1,3,2) ⁽¹⁾, 1,0,0,0).

FIG. 6 shows how c₁ ⁻¹·y_(j,3,2) may be computed for i≠j and i≠4. In FIG. 6, i=2 and j=1. As in FIG. 5 the input x_(1,3) is input to the look-up tables 520 and 522. Further, the input x2,3 is input to the look-up tables 524 because c₂=(y_(2,3,2) ⁽¹⁾, 1,0,0,0). The nibbles of y_(1,3,2) are input to tables 630, 632 and then the XOR 640 to calculate the first nibble of c₂ ⁻¹·y_(1,3,2). The computation of the second nibble runs similarly.

What remains is the case where i=4, where the value c₄ is computed. FIG. 7 illustrates a XOR network used to calculate c₄. The inputs x_(1,3), x_(2,3), x_(3,3) input to the look-up tables 520, 524, 528 respectively to produce y_(1,3,2) ⁽¹⁾, y_(2,3,2) ⁽¹⁾, y_(3,3,2) ⁽¹⁾, which are then fed into the XORs 730, 732 as shown to calculate ĉ₄ which has a one-to-one correspondence to c₄ via c₄=(ĉ₄, 1,0,0,0). This value of ĉ₄ is then used to calculate c₄ ⁻¹·y_(1,3,2) in a similar way as we calculated c₂ ⁻¹·y_(1,3,2) in FIG. 6.

The value c₁ ⁻¹·z_(2,3) may be calculated from c₁ ⁻¹·y_(1,3,2), c₁ ⁻¹·y_(2,3,2), c₁ ⁻¹·y_(3,3,2), c₁ ⁻¹·y_(4,3,2) in the same way as z_(2,3) is calculated from the output of the Q-boxes as shown in FIG. 3 by simply XORing the four bytes via the XOR network. Similarly, the output bytes c₂ ⁻¹·z_(2,3), c₃ ⁻¹·z_(2,3), c₄ ⁻¹·z_(2,3) may be computed. Further, four split versions of all of the output bytes z_(i,j) may likewise be produced. These split output bytes become the split inputs into the second round. Further, the lookup tables and network may be further obfuscated as shown in FIG. 4.

In the above discussion, the key-addition operation of the second round was ignored. This simplified the description, for in that case, the output of the round corresponds with the input of the S-box. Any function f preceding and merged with the S-box operation may be accounted for as follows. According to the embodiments of the invention, the S-box should have as input, for example, α_(2,3,k)(x). Because the S-box is preceded with the function f, the input becomes x=ƒ(z_(2,3)). To guarantee that α_(2,3,k)∘ƒ(z_(2,3)) is input to the S-box, ƒ⁻¹∘α_(2,3,k)∘ƒ(z_(2,3)) is computed instead of α_(2,3,k)(z_(2,3)) in round 1, where in this case the function f carries out the key addition function.

Next, how to compensate for the split S-box in the second round (and subsequent rounds) is described. In the second round, again groups of four bytes output by the Q-box have to be XORed. Furthermore, because the input is split, the original S-box output is obtained by XORing the result of the n S-box copies. Hence, the second round only differs from the first round in that the number of input bytes is increased by a factor n, i.e., 16n input bytes are received. This means that, to compute a single output byte for the second round, four Q-box output-bytes are not added, but rather 4n Q-box output bytes. Here these 4n output bytes can be divided into four n-tuples, where each n-tuple represents the outcome of an original S-box via the relationship h(y)=⊕_(i)β_(i)(y). As described above, the n values β₁(y), β₂(y), . . . , β_(n)(y) sum up to h(y) are generally not XORed explicitly for the following reason.

Suppose that the n values β₁(y), β₂(y), . . . , β_(n)(y) are XORed explicitly via for instance lookup tables. Instead of mounting the attack on the input of an S-box, an attacker can then mount the attack on this value h(y), which can have a fixed bijective relation to the input of the S-box. This is no longer possible if we XOR the 4n values in a different order and/or apply the splitting functions α_(i) for the next round while XORing these values, in which case the XORing of β₁(y), β₂(y), . . . , β_(n)(y) is no longer done explicitly.

It is also noted that β_(j) (y) is generally not calculated explicitly because the S-box is typically merged with other functions. This also holds in this embodiment where the S-box is merged with the MixColumns operation succeeding it. Hence, instead of β_(j)(y) the value MC·β_(j)(y) is computed for a MixColumns coefficient MC.

A feature of the embodiments described is that the choice of Γ∈W depends on a computation based upon the input and that it is not fixed. In the embodiment described above n=4 and the choice of Γ∈W depends upon only input bytes. It is possible to use a computation using more input bytes in order to further strengthen the resistance to the Billet attack. This will be described further below where n=2 and the choice of Γ∈W depends upon four of the input bytes.

A second exemplary embodiment of the invention to implement the computation of output byte z_(2,3) of first round of AES is now described. The computation of other bytes runs similarly. In the first round the n=2-tuple of bytes α_(2,3,1) (z_(2,3)), α_(2,3,2) (z_(2,3)) is computed, where the affine functions α_(2,3,k) are taken from W and are not fixed. Being an element from W, means that α_(2,3,k):x

c_(k) ⁻¹·x for some c_(k)∈GF(2⁸)\{0} with c₂=1⊕c₁.

There are four input bytes to the first round contributing to the value of z_(2,3). A first a Q-box operation is performed on these four input bytes as previously described. Let y_(i,3,2) be the ith output byte of these four Q-box operations. We now let c₁ be the product of the first four nibbles of y_(i,3,2) in GF(2⁸) unless it is equals to 1. In that case it is set to 2. That is,

$c_{1} = \left\{ \begin{matrix} {\prod\limits_{i = 1}^{4}\;\left( {y_{i,3,2}^{(1)},1,0,0,0} \right)} & {{{if}\mspace{14mu}{value}} \neq 1} \\ 2 & {otherwise} \end{matrix} \right.$ We note that (y_(i,3,2) ⁽¹⁾,1,0,0,0) indicates the byte obtained by adding a 1 and 3 zeros to nibble y_(i,3,2) ⁽¹⁾. Furthermore, c₂=1⊕c₁. Note that c₁ and c₂ are both not equal to 0, which is a necessary condition. The values of c₁, c₂ are then used in the same manner as described above. This embodiment may be expanded to other values of n. Further, other calculations to determine the values of c_(i) may be used as well in order to make the splitting of the n values of x_(i) vary depending upon the input. Such calculations are incorporated into the various functions implemented by the lookup tables in a way so as to obscure the function being performed.

Now remapping constant points from V₁ with V=V₁×V₂ to other points is applied to the embodiment described above. It is easily verified that V₁ in the embodiment described above using AES has exactly one constant point, viz. 0. Hence, U={0}. Remapping this constant point invention includes the following steps: (1) identifying whether an output byte equals 0, (2) if this is the case, then the value has to be remapped to some other value, and (3) then this operation has to be undone or compensated for in the next round.

Starting with the first step, an output byte z_(k,j) equals 0 if and only if XORing the four Q-box outputs y_(1,j,k), y_(2,j,k), y_(3,j,k), and y_(4,j,k) gives 0. FIG. 8 illustrates determining if an output z_(k,j) equals 0 that may be implemented by a small table network. The first four Q-boxes 520-523 determine the first nibble of z_(kj), while the second four Q-boxes 524-527 determine the second nibble of z_(kj). The tables 830 and 832 computes whether the first and second nibble XOR to 0 and the table 834 combines the results from 830 and 832 to determine whether z_(k,j) equals 0.

Now, this result leads to the need to remap 0 to a different value. Because z_(k,j)=y_(1,j,k)⊕y_(2,j,k)⊕y_(3,j,k)⊕y_(4,j,k) this can be done by XORing y1,j,k with any non-zero value C. This may be done by changing the computation of c_(i) ⁻¹·y_(1,3,2). This can be shown for i=4. Other cases for different values of i can be done similarly. For i=4, the computation was depicted in FIG. 7. This computation changes to the one depicted in FIG. 9. FIG. 9 illustrates remapping the 0 output to a different value. Note that this is only done for one of the two nibbles of y_(1,3,2). Otherwise, the effects cancel out.

To see the effect of the changes illustrated in FIG. 9, observe the following. Suppose that z_(2,3)=0. Without the changes in FIG. 9, this value would always be mapped to 0 in an execution of the white-box implementation (disregarding the fixed encodings as applied in FIG. 4). However, because z_(2,3)=0 is remapped to z_(2,3)=C before the multiplication with c_(i), the value 0 can be mapped to multiple values, although the remapping is done to a fixed value C.

As final step, the effect of the addition with C in case z_(k,j)=0 has to be cancelled out. So first, what error results if b_(k,j)=1 because of the modification? For each output byte u of the next round depending on the contribution is given by MC·S(z_(k,j)) for some mixcolumns coefficient MC, where for simplicity the key addition operation is ignored. This implies that the error resulting if z_(k,j)=0 is MC·δ with δ=S(0)⊕S(C). This error may be compensated for as follows. With respect to z_(k,j), the computation of u in the next round starts with n Q-boxes that have as input α_(i)({tilde over (z)}_(k,j)) for n affine functions α_(i). Here the tilde in the notation indicates that the value 0 is mapped to C. Originally, the n outputs of those Q-boxes are XORed to obtain MC·h(S(z_(k,j)))) with h being the constant addition of A_(AES). However, now this has become MC·(h(S(z_(k,j))⊕δ)) if b_(k,j)=1. To compensate for this, exactly one of these Q-boxes is chosen, say the first one, and MC·δ is added to the output depending on the value of b_(k,j). This is depicted in FIG. 10. FIG. 10 illustrates the compensation for remapping the 0 value in the next round.

While remapping of a 0 value is described, remapping of other values may be accomplished as well. Further, the remapping embodiments described above may be further generalized. Instead of mapping constant points to a non-constant point, it is possible to map “almost”-constant points to other points. To be more precise, let for an S-box input x, its diversification number D(x) be defined as the number of points to which it can be mapped by V₁. Hence, D(x)=|{ƒ(x)|ƒ∈V₁}|. Then, values with a low diversification number may be mapped to values with a higher diversification number. Such logic could be implemented, for example, in the tables 830 and 832 described above. In order to compensate for such mapping, information indicating that a remapping was done may be passed to the next round of the cryptographic function.

The above described method of remapping may also be applied to methods and systems described in Michiels as well. This would involve detecting the output values that need to be remapped, remapping them, and then compensating for the remapping in the next round.

A method according to the embodiments of the invention may be implemented on a computer as a computer implemented method. Executable code for a method according to the invention may be stored on a computer program medium. Examples of computer program media include memory devices, optical storage devices, integrated circuits, servers, online software, etc. Accordingly, a white-box system may include a computer implementing a white-box computer program. Such system, may also include other hardware elements including storage, network interface for transmission of data with external systems as well as among elements of the white-box system.

In an embodiment of the invention, the computer program may include computer program code adapted to perform all the steps of a method according to the invention when the computer program is run on a computer. Preferably, the computer program is embodied on a non-transitory computer readable medium.

Further, because white-box cryptography is often very complicated and/or obfuscated it is tedious for a human to write. It is therefore of advantage to have a method to create the cryptographic system according to the embodiments of the invention in an automated manner.

A method of creating the cryptographic system according to the invention may be implemented on a computer as a computer implemented method, or in dedicated hardware, or in a combination of both. Executable code for a method according to the invention may be stored on a computer program medium. In such a method, the computer program may include computer program code adapted to perform all the steps of the method when the computer program is run on a computer. The computer program is embodied on a non-transitory computer readable medium.

The cryptographic system described herein may be implemented on a user device such as a mobile phone, table, computer, set top box, smart TV, etc. A content provider, such as a television network, video stream service, financial institution, music streaming service, etc., may provide software to the user device for receiving encrypted content from the content provider. That software may have the encryption key embedded therein as described above, and may also include binding strings as described above. Then the content provider may send encrypted content to the user device, which may then decrypt using the supplied software and use the content.

Any combination of specific software running on a processor to implement the embodiments of the invention, constitute a specific dedicated machine.

As used herein, the term “non-transitory machine-readable storage medium” will be understood to exclude a transitory propagation signal but to include all forms of volatile and non-volatile memory. Further, as used herein, the term “processor” will be understood to encompass a variety of devices such as microprocessors, field-programmable gate arrays (FPGAs), application-specific integrated circuits (ASICs), and other similar processing devices. When software is implemented on the processor, the combination becomes a single specific machine.

It should be appreciated by those skilled in the art that any block diagrams herein represent conceptual views of illustrative circuitry embodying the principles of the invention.

Although the various exemplary embodiments have been described in detail with particular reference to certain exemplary aspects thereof, it should be understood that the invention is capable of other embodiments and its details are capable of modifications in various obvious respects. As is readily apparent to those skilled in the art, variations and modifications can be effected while remaining within the spirit and scope of the invention. Accordingly, the foregoing disclosure, description, and figures are for illustrative purposes only and do not in any way limit the invention, which is defined only by the claims. 

What is claimed is:
 1. A non-transitory machine-readable storage medium encoded with instructions for execution by a keyed cryptographic operation by a cryptographic system mapping an input message to an output message, wherein the cryptographic operation includes at least one round including a non-linear mapping function configured to map input data to output data, wherein the inputs to the non-linear mapping function are encoded using a plurality encoding functions, comprising: instructions for determining that the input data has a diversification number less than a diversification level threshold value, wherein the diversification number is the number of different output values obtained from the plurality of encoding functions for the input data; instructions for remapping the input data having a diversification number less than a diversification level threshold value to a remapped input data, wherein the remapped input data corresponds to another input data having a diversification number greater than or equal to the diversification threshold value, and instructions for inputting the remapped input data into the non-linear mapping function to obtain output data.
 2. The non-transitory machine-readable storage medium of claim 1, further comprising: instructions for splitting, by the cryptographic system, the input data into n split input data, wherein the splitting of the input data varies based upon the value of the input message; and instructions for inputting each split input data into the non-linear mapping function to obtain n split output data, wherein a combination of the n split output data indicates an output data, wherein the output data results when the input data is input to the non-linear mapping function.
 3. The non-transitory machine-readable storage medium of claim 1, further comprising: instructions for encoding the output data according to a selected one of a plurality of encoding schemes, wherein the encoding scheme is selected out of a plurality of encoding schemes based upon selection data which depends upon the input message; instructions for receiving in a next round of the cryptographic operation the encoder output as input; and instructions for compensating for the effect of the encoding according to a selected one of a plurality of recoding schemes based upon the selected recoding scheme out of the plurality of recoding schemes based upon the selection data.
 4. The non-transitory machine-readable storage medium of claim 2, wherein instructions for inputting each split input data into the non-linear mapping function includes instructions for inputting each split input data into a plurality of split mapping functions wherein XORing the plurality of split mapping functions results in the non-linear mapping function.
 5. The non-transitory machine-readable storage medium of claim 1, wherein the instructions for determining and remapping are associated with a current round of the keyed cryptographic operation producing an output of the current round that is the input to a next round of the keyed cryptographic operation and comprising in the next round compensating for the remapping of the output data.
 6. The non-transitory machine-readable storage medium of claim 5, wherein compensating for the remapping of the output data includes receiving an indication of the input value that was remapped.
 7. The non-transitory machine-readable storage medium of claim 5, wherein in the cryptographic operation is an advance encryption standard (AES) operation and wherein compensating for the remapping of the input data is based upon MC, S(0), and S(C), where MC is mix columns function, S(0) is the value of the AES substitution box for input of 0, and S(C) is the value of the AES substitution box for input of C, where the remapped output is set to the value of C.
 8. The non-transitory machine-readable storage medium of claim 1, wherein determining that the input data has a diversification number less than a diversification level threshold value includes determining that the input data corresponds to a constant point associated with the non-linear mapping function and wherein remapping the input data to a remapped input data includes setting the input data to a value C.
 9. The non-transitory machine-readable storage medium of claim 8, wherein in the cryptographic operation is an advance encryption standard (AES) operation and wherein the constant point is
 0. 10. The non-transitory machine-readable storage medium of claim 1, wherein lookup tables implement the keyed cryptographic operation.
 11. The non-transitory machine-readable storage medium of claim 1, wherein finite state machines implement keyed cryptographic operation.
 12. A method of producing a cryptographic implementation of a cryptographic operation mapping an input message to an output message, wherein the cryptographic operation includes at least one round including a non-linear mapping function configured to map input data to output data, wherein the inputs to the non-linear mapping function are encoded using a plurality encoding functions, comprising: producing a cryptographic implementation of the cryptographic operation wherein the cryptographic implementation is configured to: determine that the input data has a diversification number less than a diversification level threshold value, wherein the diversification number is the number of different output values obtained from the plurality of encoding functions for the input data; remap the input data having a diversification number less than a diversification level threshold value to a remapped input data, wherein the remapped input data corresponds to another input data having a diversification number greater than or equal to the diversification threshold value, and input the remapped input data into the non-linear mapping function to obtain output data.
 13. The method of claim 12, wherein the cryptographic implementation is further configured to: split, by the cryptographic system, the input data into n split input data, wherein the splitting of the input data varies based upon the value of the input message; and input each split input data into the non-linear mapping function to obtain n split output data, wherein a combination of the n split output data indicates an output data, wherein the output data results when the input data is input to the non-linear mapping function.
 14. The method of claim 12, wherein the cryptographic implementation is further configured to: encode the output data according to a selected one of a plurality of encoding schemes, wherein the encoding scheme is selected out of a plurality of encoding schemes based upon selection data which depends upon the input message; receive in a next round of the cryptographic operation the encoder output as input; and compensate for the effect of the encoding according to a selected one of a plurality of recoding schemes based upon the selected recoding scheme out of the plurality of recoding schemes based upon the selection data.
 15. The method of claim 13, wherein inputting each split input data into the non-linear mapping function includes inputting each split input data into a plurality of split mapping functions wherein XORing the plurality of split mapping functions results in the non-linear mapping function.
 16. The method of claim 12, wherein determining and remapping are associated with a current round of the keyed cryptographic operation producing an output of the current round that is the input to a next round of the keyed cryptographic operation and comprising in the next round compensating for the remapping of the output data.
 17. The method of claim 16, wherein compensating for the remapping of the output data includes receiving an indication of the input value that was remapped.
 18. The method of claim 16, wherein in the cryptographic operation is an advance encryption standard (AES) operation and wherein compensating for the remapping of the input data is based upon MC, S(0), and S(C), where MC is mix columns function, S(0) is the value of the AES substitution box for input of 0, and S(C) is the value of the AES substitution box for input of C, where the remapped output is set to the value of C.
 19. The method of claim 12, wherein determining that the input data has a diversification number less than a diversification level threshold value includes determining that the input data corresponds to a constant point associated with the non-linear mapping function and wherein remapping the input data to a remapped input data includes setting the input data to a value C.
 20. The method of claim 19, wherein in the cryptographic operation is an advance encryption standard (AES) operation and wherein the constant point is
 0. 21. The method of claim 12, wherein lookup tables implement the keyed cryptographic operation.
 22. The method of claim 12, wherein finite state machines implement keyed cryptographic operation. 